IN THE CLAIMS : 

1 . (currently amended) A method for compressing an index file in an 
information retrieval system that retrieves information from a plurality of documents, 
each of the plurality of documents having features occurring therein, the method 
comprising the step of: 

representing occurrence frequencies of the features in the plurality of documents 
in a compressed format in the index file , wherein the compressed format comprises a 
plurality of bin identifiers for a plurality of bins over which the occurrence frequencies 
are categorized . 

2. (original) The method of claim 1, wherein the features are textual. 

3. (original) The method of claim 1, wherein the features are non-textual. 

4. (currently amended) The method of claim 1 , wherein said representing 
step comprises the steps of: 

mapping the occurrence frequencies into a the plurality of bins; and 

storing the bin identifiers in the index file, each of the bin identifiers identifying a 

at least one of the bins to which at least one individual occurrence frequency is mapped 

assign e d. 



2 



5. (currently amended) The method of claim 4, further comprising the step of 
assigning a range of values of occurrence frequencies to each bin e stablishing e ach of th e 
plurality of bins to r e pr e sent a numerical int e rval that contains at least on e of th e 
occurr e nc e fr e qu e nci e s . 

6. (currently amended) The method of claim § 4, wherein the step of 
mapping results in one of the bins being empty at l e ast one of th e plurality of bins 
r e pr e s e nts an e mpty bin , 

7. (currently amended) The method of claim 4 5, furth e r comprising the st e p 
of establi s hing each of the plurality of bins to r e pr e s e nt wherein assigning comprises 
assigning a different range of values of occurrence frequencies numerical int e rval , such 
that the different range num e rical int e rval represented by each of the plurality of bins 
contains a substantially same number of the occurrence frequencies. 

8. (original) The method of claim 4, wherein said mapping step respectively 
maps more than a single term and a corresponding occurrence frequency into each of the 
plurality of bins, the method further comprises the step of scoring at least one of the 
plurality of documents with respect to a query, and said scoring step comprises the step of 
computing an occurrence frequency for a given one of the plurality of bins as a weighted 
average of the occurrence frequencies contained within the given one of the plurality of 
bins. 
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9. (original) The method of claim 4, wherein said mapping step respectively 
maps only a single term and a corresponding occurrence frequency into each of the 
plurality of bins, the method further comprises the step of scoring at least one of the 
plurality of documents with respect to a query, and said scoring step comprises the step of 
computing an occurrence frequency for a given one of the plurality of bins based on the 
single term and corresponding occurrence frequency mapped thereto. 

10. (original) The method of claim 4, further comprising the step of 
establishing bin boundaries for the plurality of bins based on a methodology employed to 
score the plurality of documents with respect to queries, the bin boundaries defining 
intervals within which the occurrence frequencies fall. 

1 1 . (original) The method of claim 4, further comprising the steps of: 
receiving a query having at least one term; and 

computing a relevance score for at least one of the plurality of documents with 
respect to the query, based on the bin identifiers. 

12. (original) The method of claim 4, wherein the method is implemented by a 
program storage device readable by machine, tangibly embodying a program of 
instructions executable by the machine to perform said method steps. 
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13. (currently amended) An apparatus for compressing an index file in an 
information retrieval system that retrieves information from a plurality of documents, 
each of the plurality of documents having features occurring therein, the apparatus 
comprising: 

a compression device for representing occurrence frequencies of the features in 
the plurality of documents in a compressed format in the index file , wherein the 
compressed format comprises a plurality of bin identifiers for a plurality of bins over 
which the occurrence frequencies are categorized . 

14. (original) The apparatus of claim 13, wherein the features are textual. 

15. (original) The apparatus of claim 13, wherein the features are non-textual. 

16. (currently amended) The apparatus of claim 13, wherein said compression 
device comprises: 

a bin generator for generating a the plurality of bins and a the corresponding 
plurality of bin identifiers, each of the plurality of bin identifiers respectively identifying 
one of the plurality of bins to which at least one individual occurrence frequency is 
mapped; 

a mapping device for mapping the occurrence frequencies into the plurality of 
bins; and 

a storage device for storing the bin identifiers in the index file. 
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17. (currently amended) The apparatus of claim 16, wherein said bin generator 
assigns a range of values of occurrence frequencies to each one of the bins e stablish e s 
each of the plurality of bins to repr e s e nt a num e rical interval that contains at least on e of 
the occurrence fr e qu e nci e s . 

18. (currently amended) The apparatus of claim 4-6 17, wherein said bin 
generator establish e s each of the plurality of bins to represent assigns a different range of 
values of occurrence frequencies numerical int e rval , such that the different range 
numerical int e rval represented by each of the plurality of bins contains a substantially 
same number of the occurrence frequencies. 

19. (original) The apparatus of claim 16, wherein said mapping device 
respectively maps more than a single term and a corresponding occurrence frequency into 
each of the plurality of bins, the apparatus further comprises a scoring device for scoring 
at least one of the plurality of documents with respect to a query by computing an 
occurrence frequency for a given one of the plurality of bins as a weighted average of the 
occurrence frequencies contained within the given one of the plurality of bins. 

20. (original) The apparatus of claim 16, wherein said mapping device 
respectively maps only a single term and a corresponding occurrence frequency into each 
of the plurality of bins, the apparatus further comprises a scoring device for scoring at 
least one of the plurality of documents with respect to a query by computing an 
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occurrence frequency for a given one of the plurality of bins based on the single term and 
corresponding occurrence frequency mapped thereto. 

2 1 . (original) The apparatus of claim 16, wherein said bin generator 
establishes bin boundaries for the plurality of bins based on a methodology employed to 
score the plurality of documents with respect to queries, the bin boundaries defining 
intervals within which the occurrence frequencies fall. 

22. (original) The apparatus of claim 16, further comprising a scoring device 
for computing a relevance score for at least one of the plurality of documents with respect 
to a query, based on the bin identifiers. 

23. (original) A method for compressing an index file in an information 
retrieval system that retrieves information from a plurality of documents, each of the 
plurality of documents having features occurring therein, each of the features having 
parameters corresponding thereto, the method comprising the step of: 

mapping parameter values corresponding to the parameters of the features into a 
plurality of bins; and 

storing bin identifiers in the index file, each of the bin identifiers identifying a bin 
to which is assigned at least one individual parameter value corresponding to at least one 
individual parameter. 



7 



24. (original) The method of claim 23, wherein the features are textual. 



25. (original) The method of claim 23, wherein the features are non-textual. 

26. (currently amended) The method of claim 23, further comprising the step 
of assigning a range of parameter values to each one of the bins e stablishing e ach of th e 
plurality of bins to r e pr e s e nt a numerical interval that contains at least on e of th e 
paramet e r valu e s . 

27. (currently amended) The method of claim 26 23, wherein the step of 
mapping results in one of the bins being empty at least one of th e plurality of bins 
repres e nts an e mpty bin . 

28. (currently amended) The method of claim 23- 26, furth e r comprising th e 
st e p nf Rr.tnh1ir.hing e ach of th e plurality of bins to r e pres e nt wherein assigning comprises 
assigning a different range of parameter values num e rical int e rval , such that the different 
range numerical int e rval represented by each of the plurality of bins contains a 
substantially same number of the parameter values. 

29. (original) The method of claim 23, wherein said mapping step respectively 
maps more than a single parameter and a corresponding parameter value into each of the 
plurality of bins, the method further comprises the step of scoring at least one of the 
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plurality of documents with respect to a query, and said scoring step comprises the step of 
computing a parameter value for a given one of the plurality of bins as a weighted 
average of the parameter values contained within the given one of the plurality of bins. 

30. (original) The method of claim 23, wherein said mapping step respectively 
maps only a single parameter and a corresponding parameter value into each of the 
plurality of bins, the method further comprises the step of scoring at least one of the 
plurality of documents with respect to a query, and said scoring step comprises the step of 
computing a parameter value for a given one of the plurality of bins based on the single 
parameter and corresponding parameter value mapped thereto. 

3 1 . (original) The method of claim 23, further comprising the step of 
establishing bin boundaries for the plurality of bins based on a methodology employed to 
score the plurality of documents with respect to queries, the bin boundaries defining 
intervals within which the parameter values fall. 

32. (original) The method of claim 23, further comprising the steps of: 
receiving a query having at least one parameter; and 

computing a relevance score for at least one of the plurality of documents with 
respect to the query, based on the bin identifiers. 
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33. (original) The method of claim 23, wherein the method is implemented by 
a program storage device readable by machine, tangibly embodying a program of 
instructions executable by the machine to perform said method steps. 
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